Picture for Yuzhuo Bai

Yuzhuo Bai

InFi-Check: Interpretable and Fine-Grained Fact-Checking of LLMs

Add code
Jan 10, 2026
Viaarxiv icon

MEIC-DT: Memory-Efficient Incremental Clustering for Long-Text Coreference Resolution with Dual-Threshold Constraints

Add code
Dec 31, 2025
Viaarxiv icon

FaithLens: Detecting and Explaining Faithfulness Hallucination

Add code
Dec 23, 2025
Figure 1 for FaithLens: Detecting and Explaining Faithfulness Hallucination
Figure 2 for FaithLens: Detecting and Explaining Faithfulness Hallucination
Figure 3 for FaithLens: Detecting and Explaining Faithfulness Hallucination
Figure 4 for FaithLens: Detecting and Explaining Faithfulness Hallucination
Viaarxiv icon

IROTE: Human-like Traits Elicitation of Large Language Model via In-Context Self-Reflective Optimization

Add code
Aug 12, 2025
Figure 1 for IROTE: Human-like Traits Elicitation of Large Language Model via In-Context Self-Reflective Optimization
Figure 2 for IROTE: Human-like Traits Elicitation of Large Language Model via In-Context Self-Reflective Optimization
Figure 3 for IROTE: Human-like Traits Elicitation of Large Language Model via In-Context Self-Reflective Optimization
Figure 4 for IROTE: Human-like Traits Elicitation of Large Language Model via In-Context Self-Reflective Optimization
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Figure 1 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 2 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 3 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 4 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Viaarxiv icon

Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

GLTW: Joint Improved Graph Transformer and LLM via Three-Word Language for Knowledge Graph Completion

Add code
Feb 17, 2025
Viaarxiv icon

Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering

Add code
Feb 11, 2025
Viaarxiv icon

Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values

Add code
Jan 13, 2025
Viaarxiv icon

OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems

Add code
Feb 21, 2024
Figure 1 for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems
Figure 2 for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems
Figure 3 for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems
Figure 4 for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems
Viaarxiv icon